Improved Vector Quantization Approach for Discrete HMM Speech Recognition System
نویسندگان
چکیده
The paper presents an improved Vector Quantization (VQ) approach for discrete Hidden Markov Models (HMMs). This improved VQ approach performs an optimal distribution of VQ codebook components on HMM states. This technique, that we named the Distributed Vector Quantization (DVQ) of hidden Markov models, succeeds in unifying acoustic microstructure and phonetic macro-structure when the estimation of HMM parameters is performed. The DVQ technique is implemented through two variants; the first variant uses the K-means algorithm (K-means-DVQ) to optimize the VQ, while the second variant exploits the benefits of the classification behavior of Neural Networks (NN-DVQ) for the same purpose. The proposed variants are compared with the HMM-based baseline system by experiments of specific Arabic consonants recognition. The results show that the distributed vector quantization technique increase the performance of the discrete HMM system while maintaining the decoding speed of the models.
منابع مشابه
A New Vector Quantization Front-End Process for Discrete HMM Speech Recognition System
The paper presents a complete discrete statistical framework, based on a novel vector quantization (VQ) front-end process. This new VQ approach performs an optimal distribution of VQ codebook components on HMM states. This technique that we named the distributed vector quantization (DVQ) of hidden Markov models, succeeds in unifying acoustic micro-structure and phonetic macro-structure, when th...
متن کاملDiscrete-Mixture HMMs-based Approach for Noisy Speech Recognition
It is well known that the application of hidden Markov models (HMMs) has led to a dramatic increase of the performance of automatic speech recognition in the 1980s and from that time onwards. In particular, large vocabulary continuous speech recognition (LVCSR) could be realized by using a recognition unit such as phones. A variety of speech characteristics can be modelled by using HMMs effecti...
متن کاملSpeaker adaptation using regularization and network adaptation for hybrid MMI-NN/HMM speech recognition
This paper describes, how to perform speaker adaptation for a hybrid large vocabulary speech recognition system. The hybrid system is based on a Maximum Mutual Information Neural Network (MMINN), which is used as a Vector Quantizer (VQ) for a discrete HMM speech recognizer. The combination of MMINNs and HMMs has shown good performance on several large vocabulary speech recognition tasks like RM...
متن کاملA continuous density interpretation of discrete HMM systems and MMI-neural networks
The subject of this paper is the integration of the traditional vector quantizer (VQ) and discrete hidden Markov models (HMM) combination in the mixture emission density framework commonly used in automatic speech recognition (ASR). It is shown that the probability density of a system that consists of a VQ and a discrete classifier can be interpreted as a special case of a semicontinuous mixtur...
متن کاملPerformance of hybrid MMI-connectionist/HMM systems on the WSJ speech database
In this paper, a hybrid MMI-connectionist / hidden Markov model (HMM) speech recognition system for the Wall Street Journal (WSJ) database is presented. The HMM part of this system uses discrete probability density functions (pdf). The neural network (NN) is used to replace a classical vector quantizer (VQ) like a k-means or LBG algorithm, which are typically used in discrete HMM systems. The N...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Int. Arab J. Inf. Technol.
دوره 4 شماره
صفحات -
تاریخ انتشار 2007